Wan2.2-S2V-14B is a Mixture of Experts (MoE) model designed for audio-driven cinema-level video generation. It can generate high-quality video content based on input audio, reference images, and text prompts, supports 480P and 720P resolutions, and features complex motion generation and cinema-level aesthetic effects.
Multimodal
DiffusersSpanish